Loading…
This event has ended. Visit the official site or create your own event on Sched.
Wednesday, July 27 • 1:30pm - 1:50pm
Making Data Pipelines in R: A Story From A “Self-Taught” Perspective

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

When people first learn about R’s capabilities to create fully integrated systems, automated visuals, and seamless data pipelines, the reaction can span from disbelief to amazement. R’s expansive capabilities can leave some feeling overwhelmed when tasked with larger projects like data pipelines. This talk invites the participant to hear the perspective of a self-taught R user who used curiosity and patience to create a functional data pipeline in R for a local health department. Specifically, this talk will touch on the following concepts:

  • Surveying Data Landscapes
  • File Structures
  • Saving Yourself with Data Validation
  • Modularizing Code and Connecting R Scripts
  • Thinking about Pipeline Sustainability
  • Remaining Calm in Unfamiliar R Territories

Talk materials are available at https://github.com/Meghansaha/pipelines_in_R.

Speakers
avatar for Meghan S Harris

Meghan S Harris

PCCTC @ Memorial Sloan Kettering
Meghan Harris is a self-taught R user that is currently a Data Scientist at the PCCTC at the Memorial Sloan Kettering Cancer Center. Meghan’s work allows her to work with data, create custom reports, dashboards, and various solutions using the R programming language daily. Meghan... Read More →


Wednesday July 27, 2022 1:30pm - 1:50pm EDT
4. National Harbor 10+11